The Odd One Out: Identifying and Characterising Anomalies
نویسندگان
چکیده
In many situations there exists an abundance of positive examples, but only a handful of negatives. In this paper we show how in binary or transaction data such rare cases can be identified and characterised. Our approach uses the Minimum Description Length principle to decide whether an instance is drawn from the training distribution or not. By using frequent itemsets to construct this compressor, we can easily and thoroughly characterise the decisions, and explain what changes in an example would lead to a different verdict. Furthermore, we give a technique through which, given only a few negative examples, the decision landscape and optimal boundary can be predicted—making the approach parameter-free. Experimentation on benchmark and real data shows our method provides very high classification accuracy, thorough and insightful characterisation of decisions, predicts the decision landscape reliably, and can pinpoint observation errors. Moreover, a case study on real MCADD data shows we provide an interpretable approach with state-of-the-art performance for screening newborn babies for rare diseases.
منابع مشابه
Interpretation of gravity anomalies via terracing method of the profile curvature
One of the main goals of interpretation of gravity data is to detect location and edges of the anomalies. Edge detection of gravity anomalies is carried out by different methods. Terracing of the data is one of the approaches that help the interpreter to achieve appropriate results of edge detection. This goal becomes a complex task when the gravity anomalies have smooth borders due to gradual ...
متن کاملAlteration in incidence and pattern of congenital anomalies among newborns during one decade in Azarshahr, Northwest of Iran
Background and aims: Congenital anomalies are as the major causes of stillbirths, neonatal death, disability and childhood health problems all over the world. The aim of this study was to determine the incidence and pattern of congenital anomalies in newborn during the first 24 hours of life in Shahid-Madani hospital, Azarshahr, Tabriz, during two periods 2002-2003 and 2...
متن کاملThat s Odd! How Scientists Respond to Anomalous Data
We use an in vivo methodology to investigate the responses of scientists to anomalies. Protocols of 3 scientists performing data analysis in 2 domains were analyzed. We found that the scientists noticed anomalies and paid more attention to them than to expected data. This attention took the form of proposing a hypothesis and then elaborating that hypothesis by reference to other data in the vis...
متن کاملUse of a Two-Channel Moiré Wavefront Sensor for Measuring Topological Charge Sign of the Vortex Beam and Investigation of Its Change Due to an Odd Number of Reflections
One of the solutions of the Helmholtz equation is the vortex beams. In the recent decades, production and applications of these types of beams have found serious attentions. Determination of the vortex beam topological charge and its sign are very important issues. Odd number of reflections of the vortex beam changes its vorticity. In this paper, we have used a q-plate to generate a vortex beam...
متن کاملPrevalence Rate of Congenital Anomaly of Male Newborn in Fasa Hospital
Background & Objectives: Congenital anomalies of external genitalia are one of the most frequently congenital anomalies especially in the boys neonates, that in many cases no definite cause was found for them. Because of having basic knowledge will help our for early diagnosis planning, early treatment and decreasing psycho-social problems of these patients and their parents, we decided to carr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011